Geodesic Gaussian kernels for value function approximation

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Geodesic Gaussian kernels for value function approximation

The least-squares policy iteration approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular and useful choice as a basis function. However, it does not allow for discontinuity which typically arises in real-world reinforcement learning tasks. In this paper, we propose a new basis function based on ...

متن کامل

Geodesic Distance-based Kernel Construction for Gaussian Process Value Function Approximation

Finding accurate approximations to state and action value functions is essential in Reinforcement learning tasks on continuous Markov Decision Processes. Using Gaussian processes as function approximators we can simultaneously represent model confidence and generalize to unvisited states. To improve the accuracy of the value function approximation in this article I present a new method of const...

متن کامل

Nonlinear Approximation Using Gaussian Kernels

It is well-known that non-linear approximation has an advantage over linear schemes in the sense that it provides comparable approximation rates to those of the linear schemes, but to a larger class of approximands. This was established for spline approximations and for wavelet approximations, and more recently for homogeneous radial basis function (surface spline) approximations. However, no s...

متن کامل

On Dimension-independent Rates of Convergence for Function Approximation with Gaussian Kernels

This article studies the problem of approximating functions belonging to a Hilbert space Hd with an isotropic or anisotropic translation invariant (or stationary) reproducing kernel with special attention given to the Gaussian kernel Kd(x, t) = exp ( − d ∑ `=1 γ ` (x` − t`) 2 ) for all x, t ∈ R. The isotropic (or radial) case corresponds to using the same shape parameters for all coordinates, n...

متن کامل

Average Case Approximation: Convergence and Tractability of Gaussian Kernels

We study the problem of approximating functions of d variables in the average case setting for a separable Banach space Fd equipped with a zero-mean Gaussian measure. The covariance kernel of this Gaussian measure takes the form of a Gaussian that depends on shape parameters γl. We stress that d can be arbitrarily large. Our approximation error is defined in the L2 norm, and we study the minima...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Autonomous Robots

سال: 2008

ISSN: 0929-5593,1573-7527

DOI: 10.1007/s10514-008-9095-6